The stability of different aggregation techniques in ensemble feature selection
نویسندگان
چکیده
Abstract To mitigate the curse of dimensionality in high-dimensional datasets, feature selection has become a crucial step most data mining applications. However, no method consistently delivers best performance across different domains. For this reason and order to improve stability process, ensemble frameworks have increasingly popular. While many examined construction techniques under various considerations, little work been done shed light on influence aggregation process selection. In contribution field, aims explore impact some selected strategies ensemble’s accuracy. Using twelve classification real datasets from domains, accuracy five were four standard filter methods. The experimental analysis revealed significant differences both behavior aggregations, especially between score-based rank-based strategies. Moreover, it was observed that simpler based Arithmetic Mean or L2-norm appear be efficient compelling cases. Given structure associated application domain, work’s findings can guide ensembles using suitable rules.
منابع مشابه
Robust Feature Selection Using Ensemble Feature Selection Techniques
Robustness or stability of feature selection techniques is a topic of recent interest, and is an important issue when selected feature subsets are subsequently analysed by domain experts to gain more insight into the problem modelled. In this work, we investigate the use of ensemble feature selection techniques, where multiple feature selection methods are combined to yield more robust results....
متن کاملA Rank Aggregation Algorithm for Ensemble of Multiple Feature Selection Techniques in Credit Risk Evaluation
In credit risk evaluation the accuracy of a classifier is very significant for classifying the high-risk loan applicants correctly. Feature selection is one way of improving the accuracy of a classifier. It provides the classifier with important and relevant features for model development. This study uses the ensemble of multiple feature ranking techniques for feature selection of credit data. ...
متن کاملClassification Performance of Rank Aggregation Techniques for Ensemble Gene Selection
A very promising tool for data mining and bioinformatics is ensemble gene (feature) selection. Ensemble feature selection is the process of performing multiple runs of feature selection and then aggregating the results into a final ranked list. However, a central question of ensemble feature selection is how to aggregate the individual results into a single ranked feature list. There are a numb...
متن کاملStability of Feature Selection Algorithms and Ensemble Feature Selection Methods in Bioinformatics
1.
متن کاملMLIFT: Enhancing Multi-label Classifier with Ensemble Feature Selection
Multi-label classification has gained significant attention during recent years, due to the increasing number of modern applications associated with multi-label data. Despite its short life, different approaches have been presented to solve the task of multi-label classification. LIFT is a multi-label classifier which utilizes a new strategy to multi-label learning by leveraging label-specific ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Big Data
سال: 2022
ISSN: ['2196-1115']
DOI: https://doi.org/10.1186/s40537-022-00607-1